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ABSTRACT 


The most effective results in the processing and recognition of images are achieved through the development of 
information technology, the introduction of new technologies. The TensorFlow library capabilities in the python 
programming environment are huge in getting effective results. It describes how to use open data sets to recognize Arabic 
graphics and to form datasets for additional letters for old Uzbek and Farsi text letters. Examples, advantages and 
analysis of the results obtained by the TensorFlow platform to perform calculations with python are given. The efficacy 
and mode of use of the convolutional neural network are described. Results were obtained from the use of open data sets 
via www.kaggle.com. The most effective methods of recognizing the given Arabic text are used, and only the results 
obtained are described, without the algorithms given in the references. CNN created a model for the data set letters, based 


on which the results were 90% recognizable. 
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INTRODUCTION 


There are many ready-made libraries for working with neural networks for the Python programming language. The use of 
these libraries is designed to significantly simplify the task of developing neural networks. TensorFlow [2] is a very 
popular library developed by Google. The documentation for this library is well developed. There are many articles on the 
internet and examples of neural networks for this library. Here, graphs are used to represent a neural network, and 
multidimensional rows, tensors, are used to store data. Like other libraries, it has integration with the NumPy library. Both 
the CPU and the GPU have the ability to perform calculations. The Keras library is, more precisely, the basis for the 
creation of neural networks. In addition to Keras TensorFlow and Theano, Keras can use one of these two libraries to 
perform calculations. Of all the libraries, this is the most convenient and easy to understand. Here new layers are added 
using only one function. Good documentation and lots of ready-made examples - all of which are a big advantage for 
Keras. The development of these capabilities has the great advantage of recognizing manuscript sources with complex 
forms. Arabic graphics have a complex shape, and the fact that its 28 letters appear in different forms in the middle and at 


the end of a word, and that they form a different shape due to the combination of letters, causes some problems in 
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recognition. In solving these problems, we analyze the results using the following segmentation algorithms and neural 


networks in recognition. 
Methodology of Segmentation of the Arabic Text 


We will be able to make effective use of neural network capabilities and analyze the results in a python programming 
environment. We use horizontal segmentation to recognize each line and vertical segmentation to separate words and 


letters. 
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Figure 1: Segments in Recognition. 





How to add text to letters in Arabic, these three segmentations are done using word segmentation. Line 
segmentation approaches: Projection-based approach; when assembled horizontally, this is called the horizontal projection 
along the vertical axis for each x value, and this is called the vertical prol8]. Smearing approach; text strings calculated 
within a predetermined boundary between the white space along the horizontal line are bounded by copies [6, 10, 11]. 
Grouping method; it is built by grouping adjacent Arabic text such as "5, ~»", entries such as "4, 5". The problem is that 
projection-based words and characters use a simple constraint tool during character segmentation. It is based on certain 
cognitive criteria such as similarity, continuity, and closeness [7, 10]. Word and Character Segmentation Approaches: 


There are four main approaches to linking characters in an Arabic word [1]. 
e Assume that the input is already divided into characters (there is no need for character segmentation). 


e Divide the introductory words into smaller primitive parts than the character, and then group each elementary 


group into characters as they are identified. 
e Divide words into characters. This is the most difficult approach in the language of nature. 
e Recognize keywords without segmentation in general. 


Histogram-Based Algorithms Several algorithms use histogram-based methods in different languages for OCR. 
None of these algorithms solve all the problems associated with OCR in Arabic. In Arabic, only the division into linear and 


connected parts is done, and the most important thing in Arabic is to separate these characters, and this is ignored [20]. 


Proposed Algorithm This document belongs to a segmentation order that accepts an Arabic text image and outputs 
separated characters. It consists of three stages. The first is responsible for identifying and separating lines in the text. The 
segmented lines then skip the second step, which is to retrieve words from the text lines. Finally, the third stage takes these 
words and creates a characteristic appearance of each word. The proposed algorithm considers the nature of the character 
between the characters and partially overlaps .The previous step is used to detect and correct the curvature of the scanned 


text image. The algorithm in [15] is used to perform the angle of rotation correction. 
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Line splitting: Line segmentation is performed using the Image Axis Profile method, which calculates the 


horizontal axis profile for a binary text image [10]. 








Figure 2: Separate Words from a Line. 


Figure 2 shows an image of an Arabic text line and a corresponding vertical profile. The text line is divided into 
parts that are connected from left to right. These connected parts are grouped into the appropriate word. Each word is an 


introductory image of the character segmentation phase. 
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Figure 3: Characterization Procedure. 
Image highlighting is used to identify a pre-configured data set using a model taught in CNN. In recognizing these 


manuscripts, the data set used an open data set available at https://www.kaggle.com for the Urdu letter, and the results were 


obtained. 
Procedure of Using CNN 


A distinctive feature of the CNN neural network is that first-level neurons are arranged in a special structure: in the first 
layers, neurons are divided into images of a certain size (sometimes called cards), and different cards within the same layer 
represent neurons that respond to different image characteristics. Corresponds to different species [3]. There are two types 
of calculating the activation of the next layer in CNNs. The first type: activation of neurons at the next level 1s considered 
as a linear combination of activation of neurons at the previous level, and the weight of these linear activities depends only 
on the relative position of neurons, types of neurons, but they do not depend on the state of a particular neuron on the map. 
The second type: the activation of neurons at the next level simply replicates the activity of neurons at the previous level, 


but the activation of neighbouring neurons because they are replaced by their maximum or average values is called the 
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process of reduction. This structure makes it very convenient for CNN to work with images because it allows, for example, 
if two images differ by a small shift, the network achieves a very similar result at the same time as in normal, non- 
convulsive neural networks. This is not the right result. In addition, the number of parameters in convulsive artificial neural 
networks is smaller than the number of neurons. Typical neural networks for the same number of neurons can have 
hundreds of billions of parameters, and it is impossible to include training kits to train such a large number of parameters, 
there are aggregate neural networks with so many neurons can be taught in samples and this ensures a high result in the 


recognition of manuscript texts [2]. 


A review of ready-made solutions for the creation of neural networks, the first CNN (Convolutional Neural 
Network) neural network for character identification, was introduced in 1998 by French researcher Yann LeCun [1]. It’s 


called LeNet. The structure of this network is shown in the following figure. 
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Figure 4: The Structure of the LeNet Neural Network. 
The general calculation procedure of CNN was chosen as follows: 
CNN=40:3x3 pool 2x2, CNN=60:3x3, pool=2x2, Istm=200, drout=0.5. ctc loss 


Recognition software developed in python. Here is the identification procedure based on the code. Here 40 filters 
and a 3x3 pointer are the core function and we use one pool = 2x2. We use 60 filters 3x3 core function, pool = 2x2 1.e. 2 


CNN. To get small input settings. 


The CNN structure serves to obtain glyphs of given words from a series of segmented images. The series image is 
large in size and can be swiped 40 times using the 3x3 core function and the pool 2x2. In the next CNN section, a swipe 
Operation is performed with 60 3x3 core functions and pool = 2x2. The CNN neural network output class is the number of 
rows. In an interconnected neural network, a multilayer perceptron neural network was used and classified using inner 


layers with 16 neurons and a relay activation function. 


On CNN, input neurons are numerical representations of a series of images: 
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Figure 5: Image Processing in Neurons. 


If the image size is large, the number of input neurons will increase accordingly. For example, in an image 
measuring 28x28 pixels, the number of input neurons is 784, which is 13,002 internal neurons in a multilayer perceptron 
neuron. At CNN, we can use a kernel matrix to extract important features of an image, and we can have an active matrix 
using a pool swipe operation. Using this active matrix, we can store the base parameters of the classes of this shape. This 


led to a contraction of the input neurons given to us. An example of a symbol 1s: 





Figure 6. Different Aspects of Each Character. 


We can distinguish its vertical, horizontal, arc shapes using a CNN core matrix. 





Figure 7: Separating the Horizontal Parameters of the Number. 


The nucleus matrix achieves a drastic reduction in the values of access to the neuron by isolating the main parts. 
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Figure 8: Separation of the Main Features by 2 Core Switches. 


A series image is inserted into the CNN input layer. 


(f *g)[m,n]= >) flm—k,n-1]* gk, 1] 
. (1) 


Here f is the input matrix and g is the core matrix. 
A new active matrix is formed by a kernel (1) of the given nxn size: a pool is created in the active matrix. 


The results from the row segmentation separate and classify the glyphs that are the basis for each row, that is, the 


general forms of word and letter forms. 
The Result 
from tkinter import * 
from PIL import ImageTk, Image 
from tkinter import filedialog,messagebox 
from ttk import Frame, Label 
import configparser 
import cv2 
import tempfile 
import numpy as np 


import os 
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import subprocess 
libraries are used using tensorflov platform and 


hdf5 file capabilities were used. 
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Figure 9. Using Neural Network in Model Training Window. 


J is stored in the number extension. With Class Attrs () we can use the functions and procedures available in the 


library. 
class Attrs(): 
def _ init__(self): 
self.files = glob_all([os.path.join(this_dir, "data", "train", "*.png")]) 
self.seed = 24 
self.backend = "TensorFlow" 
self.network = "CNN=40:3x3,pool=2x2,CNN=60:3x3,pool=2x2,lstm=200,dropout=0.5" 
self.checkpoint_frequency = 1000 
self.max_iters = 1000 
self.stats_size = 100 
self.no_skip_invalid_gt = False 
self.no_progress_bars = True 
self.output_dir = os.path.join(this_dir, "test_models") 
self. whitelist_files = [] 
self. whitelist = [] 


self.gradient_clipping_mode = "AUTO" 
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self.n_augmentations = 0 


en 


self.fuzzy_ctc_library_path = 
self.num_inter_threads = 0 
self.num_intra_threads = 0 
self.text_regularization = ["extended"] 


self.text_normalization = "NFC" 


| ae | = = _ ye Cosgate anement i) Orxpeite © _HS Bauer Bce 


ee ge A 0 x 
ee Fae 


ara eo, Ses a Mies clGcld pie 3 wit es 1% 
sae Hm FIILK SLL ota) Jed dtl ot ede! OS! OS 


¢ 4 
a ae os JY Susi ghgh 43) Sisld ype jy’ id Ale Ae H rte 
n-* eo ° 6 


‘ 1 tet « NS &bS ost o@ . ‘ le ! <) s/* ry 
—_ OF SUF RE USS De Cth 5) PARA «oh etna eu Ba 


~ oe 
FE> Cpe ot S 8 1.Rasm ochish 
IF J 
PT 4 é at wy ¢ toi ‘ le att «it sy 
BCMA alte 9D Ft ee! Os!) SUID g) Cel oli 225 
Srehel « N é. ve oN ,ehe ’’ ) 4] as ie 
AAK SG SIF 0 PHO HN AW SIF DPAIF  . O H 
Let ié sa af S : tod ' 
ple Se ae LF ORS. Ce 2 ee Sh 
re 4 ‘ . Null j . Nia . ._- . 
sp rs S Ji auake Cle She 


ot Cheah Ce! Cte Ge Des Ft ee ph! 2.Fuzzy algoritmi 
S35: 3g) ABs UY mle Ol OAL es 4855 ail ps tng 
pays Chea! a glst Glade! Als aighyl slY yyta yl” ad: Ale 
Sg Ge A aS “Cua!” AS Slat oes pglt dulalp siagha gi y 
Rs PK, CP OFF Pe ee | 
noes 9 WIP? DFA CPT CEE Be 
= = - 3S tatsry 
Ls fer Usr.- ASF ££C> ‘a ae 3,Segmentatsiya 
arn Se = 
CFV ug LE. 90 
oso 2 “Se —_ 4 . 
La Dt eo) ee ee rem Oe oo Nianat 
_—s on, oe == ?f 
SIC Fa los a ee 4.Tanish 


















Bee’ i Ea. BO ae lO) el ee 





Figure 10. Neural Network Usage Interface in Image Recognition. 


Based on the module created by leNet, it is possible to create models by teaching images using ready-made 
modules to simplify calculations and achieve results. Using the generated model, it is possible to obtain familiar results 
using the necessary functions in image recognition. Several neural models are available in the TensorFlow library and 
provide opportunities to perform their comparative analyzes. Open data sets are available at https://www.kaggle.com. They 
are data sets for letters, words, lines, magazines, books, and with the help of our software we used the following data sets 


to recognize Arabic text. 


Table 1: Results Obtained in the Acquaintance 


Urdu Handwritten Characters Dataset JyauliaAvlenvensan 62% 
https://www.kaggle.com 
Urdu Handwritten Characters Dataset ivi anmadrade Khadieade 60% 
https://www.kaggle.com 
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We have expanded these data sets for the old Uzbek script. Up to 4,000 sample manuscripts were taken for each 
letter. There are 28 letters in Arabic and 32 letters in Old Uzbek. We added 4 sides to the data set. That is, we found and 


placed up to 2,000 samples for each letter. 


Table 2: Results from the Extended Data Set 


Data Set Kitob Nomi 


Olb Uzbek Handwritten Characters Dataset 
(https://www.kaggle.com/mloey l/ahcd1) ia dcsiiaanacias aia 





Arabic Handwritten Characters Dataset ’ 
CONCLUSIONS AND RECOMMENDATIONS 


Results were obtained using a TensorFlow library in a python programming environment to recognize Arabic graphics. 
TensorFlow library made it possible to facilitate neural network accounting problems. In programming, python has 
program code close to human language, supports opencv capabilities, allows you to use ready-made libraries using the 
TensorFlow platform. It is possible to expand a given graph by expanding open-type data sets. The results of recognizing 
lithographs in Arabic graphics with 90% accuracy were obtained. When data sets are formed separately for each 
calligrapher in recognition with an accuracy of up to 90 percent, we can achieve efficiency in recognizing his book by 
selecting the calligrapher as we choose the language in finereadear because each calligraphy has its own style. The 
software can be used at the Imam Bukhari International Research Center to print more than 70,000 Arabic-language 
sources in Old Uzbek. The development of neural networks, 1.e. the introduction of new models, the introduction of special 
libraries in the python programming language, the availability of open data sets, These capabilities serve to easily and 


easily process the problems facing programming and bring accurate results. 
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